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Abstract 

M ost of the laboratory studies on recasts have examined the role of intensive 
recasts provided repeatedly on the same target structure. This is different 
from the original definition of recasts as the reformulation of learner errors 
as they occur naturally and spontaneously in the course of communicative 
interaction. Using a within-group research design and a new testing method¬ 
ology (video-based stimulated correction posttest), this laboratory study ex¬ 
amined whether extensive and spontaneous recasts provided during small- 
group work were beneficial to adult L2 learners. Participants were 26 ESL 
learners, who were divided into seven small groups (3-5 students per group), 
and each group participated in an oral activity with a teacher. During the ac¬ 
tivity, the students received incidental and extensive recasts to half of their 
errors; the other half of their errors received no feedback. Students’ ability to 
detect and correct their errors in the three types of episodes was assessed 
using two types of tests: a stimulated correction test (a video-based computer 
test) and a written test. Students' reaction time on the error detection por¬ 
tion of the stimulated correction task was also measured. The results showed 
that students were able to detect more errors in error-frecast (error followed 
by the provision of a recast) episodes than in error-recast (error and no recast 
provided) episodes (though this difference did not reach statistical significance). 
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They were also able to successfully and partially successfully correct more errors 
in error-H-ecast episodes than in error-recast episodes, and this difference was 
statistically significant on the written test. The reaction time results also point 
towards a benefit from recasts, as students were able to complete the task 
(slightly) more quickly for error-H-ecast episodes than for error-recast episodes. 

Keywords : error detection; error correction; recast; corrective feedback; 
grammaticality 


1. Introduction and literature review 

Within the field of second language acquisition (SLA) research, corrective feed¬ 
back, a response by a teacher or other interlocutor that attempts to signal to a 
nonnative speaker (NNS) the incorrectness/ungrammaticality of the NNS's ut¬ 
terance, has received a great deal of attention in recent years. Among corrective 
feedback types, recasts have received the most attention from researchers. 
Early observation classroom studies have found that recasts occur more often 
than any other type of corrective feedback in natural L2 classrooms; this obser¬ 
vation was found in ESL classrooms (Havranek, 1999; Panova & Lyster, 2002), 
French immersion classrooms (Lyster, 1998; Lyster & Ranta, 1997), a German L2 
classroom (Lochtman, 2002), and in NS-NNSdyadic interaction (Iwashita, 2003; 
Nassaji, 2007, 2009). Recasts are generally considered to be a form of implicit 
correction in that they do not interrupt the flow of meaningful interaction (see 
Doughty & Varela, 1998; Goo & Mackey, 2013; Nicholas, Lightbown, & Spada, 
2001), and teachers may use recasts because they do not slow down the flow 
of communication as much as some other types of corrective feedback. 

Theoretically, recasts have been assumed to be beneficial for language 
learning because they provide positive evidence: They supply the learner with 
the correct form (Ellis & Sheen, 2006; Nicholas et al., 2001). It has also been 
claimed that recasts provide negative evidence, which has been considered nec¬ 
essary for adults L2 learners. Recasts may also increase the perceptual saliency 
of the target form as the juxtaposition of the learner's utterance and the 
teacher's recast highlight the error (Farrar, 1990; Saxton, 1997). Finally, it has 
been proposed that recasts promote interaction. Long's interaction hypothesis 
states that in addition to input, participation in interaction is needed for second 
language learning to occur (Long, 1981,1983,1996). Closely related to the idea 
of interaction is Swain's (1985) concept of pushed output, which proposes that 
learners must be pushed to produce modified output in order for L2 learning to 
occur; recasts are proposed to push learners to change their output. 
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Given the theoretical benefits for recasts, many studies have examined 
the beneficial effects of recasts (Ammar & Spada, 2006; Dilans, 2010; Ellis, Loe- 
wen, & Erlam, 2006; Iwashita, 2003; Loewen & Philp, 2006; Lyster, 2004; Lyster 
& Ranta, 1997; Lyster, Saito, & Sato, 2012; Mackey & Philp, 1998; Nabei & 
Swain, 2002; Nasssaji, 2007, 2009; Panova & Lyster, 2002; Saito, 2013 among 
others). These studies, the results of which have also been summarised in a 
number of recent reviews and meta-analyses (e.g., Li, 2010; Lyster & Saito, 
2010; Lyster, Saito & Sato, 2012; Nassaji, 2015; Sheen, 2011), have reported pos¬ 
itive effects for recasts in general. However, most of such studies have provided 
recasts intensively and repeatedly on the same target structure, which can be 
different from the definition of recasts as reformulation of the learner errors as 
they occur incidentally in the course of interaction. Of course, some early stud¬ 
ies on recasts were conducted in natural second language classrooms and were 
observational in nature (Ellis, Basturkmen, & Loewen, 2001; Lochtman, 2002; 
Lyster, 1998; Lyster & Ranta, 1997; Panova & Lyster, 2002). However, these 
studies used uptake, with uptake being defined as a learner's production of the 
correct form following the corrective feedback, to see how effective the various 
types of corrective feedback were. The results of these studies showed that re¬ 
casts might not be as effective as other types of feedback as they tended to 
produce less uptake than, for example, clarification requests. However, despite 
such studies, a number of researchers have questioned whether uptake and 
other measures of noticing can be used as valid measures of the beneficial role 
of recasts (see Loewen, 2005; Loewen & Philp, 2006; Mackey & Philp, 1998; 
Nicholas et al., 2001). The use of uptake is perhaps most often linked to 
Schmidt's noticing hypothesis (Schmidt, 1990, 1995), which proposed that at¬ 
tention is necessary for language learning to occur. However, as Mackey and 
Philp (1998) point out, while uptake can signal the presence of noticing, no up¬ 
take does not necessarily indicate the absence of noticing. 

In the movement awayfrom uptake as the sole measure of the effective¬ 
ness of recasts, a number of researchers turned to pre-/posttest designs. With 
the introduction of pre-/posttests also came the increased use of preselected 
target linguistic forms. The need for such targets is clear: It is nearly impossible 
to pretest linguistic forms when you do not know on which forms the learner 
will produce errors and receive spontaneous recasts. Table 1 summarizes a 
number of pre-/posttest studies that have examined recasts. 
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Table 1 Summary of a sample of pre-/posttest recast studies 

Study 

M easure 

Feedback types examined 

Context 

Preselected 

Intensive 


of L2 learning 



target form 

extensive 

M ackey 

Pre-/posttests 

Recasts 

Dyads 

English 

Intensive 

& Philp (1998) 




questions 


McDonough & 

Pre-/posttests 

Recasts 

Dyads 

English 

Intensive 

Mackey (2006) 

and uptake 



questions 


Iwashita (2003) 

Pre-/posttests 

Naturallyarising* 

Dyads 

Japansese locative-ini¬ 
tial construction & te- 
verb 

Extensive 

Leeman (2003) 

Pre-/posttests 

Recasts vs. negative evi¬ 

Dyads 

Spanish noun-adjec¬ 

Intensive 



dence vs. enhanced sali- 
ency 


tive agreement 


Han (2002) 

Pre-/posttests 

Recasts 

Small group 

Tense consistency 

Intensive 

Lyster (2004) 

Pre-/ posttests 

Recasts vs. prompts 

Classroom 

French noun gender 

Intensive 

Ammar 

Pre-/posttests 

Recasts vs. prompts 

Classroom 

3rd person singular 

Intensive 

& Spada(2006) 




determiners 


Ellis, Loewen, 

Pre-/posttests 

Recasts vs. metalinguistic 

Classroom 

Past tense -ed 

Intensive 

& Erlam (2006) 


feedback 




Ishida (2004) 

Pre-/posttests 

Recasts 

Dyads 

Japanese-te /-(ru) 

Extensive 

Nassaji (2006) 

Pre-/ posttests 

Recasts vs. elicitations 

Dyads 


Extensive 

Loewen 

Posttests 

Naturallyarising* with a 

Classroom 


Extensive 

& Philp (2006) 

and uptake 

focus on reporting recasts 





* Note. In studies labeled as examining natura//yaris/'ng feedback, no typesof feedback were preselected 
for study and the researchers examined all feedback types that arose naturally during the interaction, 


While the above studies have found some benefits for recasts, most of 
them employed intensive recasts directed at preselected target linguistic forms. 
Therefore, it is not clear whether their findings were due to the recasts them¬ 
selves or the intensity of the feedback. A few of those studies have examined 
the effects of extensive recasts. Ishida (2004) attempted to lessen the unnatural 
situation of providing intensive recasts by providing recasts "whenever [the re¬ 
searcher] felt the need to confirm the meaning of the message" (p. 340). While 
caution should be employed due to the small number of students involved (N = 
4), the results of Ishida's study showed a positive correlation between the num¬ 
ber of recasts a learner received and their accurate use of -te i-(ru). Nassaji 
(2006) went one step further than Ishida by examining truly extensive recasts 
using an innovative pre-/posttest design with no preselected target form. Each 
of 42 learners was asked to write a description based on a series of pictures; the 
descriptions were then collected and the NNS was asked to orally describe the 
story. During this oral interaction, corrective feedback was provided whenever 
the NS felt it was appropriate. After the oral interaction, the written story was 
returned to the student and he/she was asked to make corrections. A similar 
delayed posttest was given two weeks later. Results showed that learners suc¬ 
cessfully corrected more of the errors that had received recasts than the errors 
that had received elicitations. Loewen (2005) conducted a classroom study on 
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the effectiveness of spontaneous focus-on-form (both reactive and preemptive) 
on L2 learning. Although his study did not single out recasts for examination, 
the data was reexamined in Loewen and Philp (2006) with a close focus on re¬ 
casts. Individualized immediate and delayed posttests were created for each 
student based on focus-on-form episodes (FFEs) that had occurred in the class¬ 
room interaction. The posttests consisted of oral suppliance, correction, and 
pronunciation tasks. As the feedback was spontaneous and the researchers did 
not know ahead of time which forms would receive feedback, no pretests could 
be administered. Instead, the initial errors made by the students served as a 
type of pretest (i.e., if the student made the error, it indicated that their 
knowledge of that form was to some degree incomplete). 

The above findings suggest that recasts, even when provided incidentally 
and extensively, may be beneficial to L2 students, both in dyadic interaction and 
in classroom interaction. However, more research is needed in this area, espe¬ 
cially in the area of nondyadic interaction. In addition, there are some method¬ 
ological concerns with these studies. First, there is concern over the naturalness 
of dyadic studies. In dyadic studies, learners receive undivided individual atten¬ 
tion from a NS over an extended period of time, which is not the norm in a typ¬ 
ical L2 classroom. As such, it is possible that students pay more attention in dy¬ 
adic interaction. Ellis and Sheen (2006), Lyster (2004), and Nicholas, Lightbown, 
and Spada (2001) have pointed out that dyadic interaction may produce differ¬ 
ent results than larger-group interaction. Second, there isthe concern that stud¬ 
ies that use preselected target linguistic forms and/or intensive recasts may in¬ 
crease learners' attention to recasts and the forms targeted by the recasts more 
than natural L2 classrooms (see Ellis & Sheen, 2006; Nicholas et al., 2001). Since 
the pretest, tasks during treatment sessions, and posttests were all designed to 
elicit the target form, the students may have picked up on this and directed 
their attention to that form. Third, there is the concern that studies that exam¬ 
ine the effectiveness of providing intensive recasts in response to errors in the 
target form while ignoring all other types of errors may have little value when 
discussing the effectiveness of recasts in real L2 classrooms, which are generally 
provided extensively. Examples of recasts from observational studies have 
shown that recasts provided in L2 classrooms are generally provided in re¬ 
sponse to a wide range of errors (Loewen & Philp, 2006; Nassaji & Hawkes, 
2006; Sabbagh, 1998; Sheen, 2006). Thus, as Ellis and Sheen (2006) pointed out, 
"the claim that recasts are most effective when they are focused and intensive 
(i.e., directed repeatedly at a single linguistic feature) is of little practical signif¬ 
icance to teachers" (p. 597). 

An additional problem that needs to be overcome in the use of posttests 
when examining spontaneous, extensive recasts is the matter of to what the 
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results of the posttest should be compared. Since it is not possible to give a 
pretest when you do not know what forms students will receive recasts on, how 
can we know whether the accuracy on posttests reveals something? We need 
to find something meaningful with which to compare it before we can make any 
claims about the beneficial role of recasts. 

The above review has demonstrated that there are unresolved issues with 
previous studies surrounding varying results and methodological concerns. The 
concerns with previous recast studies are neither few nor trivial, and they make 
a strong call for further study of recasts, and in particular for new methodology 
in the study of extensive recasts. The purpose of the present study is to examine 
the effectiveness of spontaneous, extensive recasts in small-group interaction 
by employing a new testing methodology: stimulated correction. The stimulated 
correction testing method was designed to have the following advantages over 
previous measures of the effectiveness of recasts: 

• It is a timed, oral test (both stimuli and response); this is designed to put 
the student in a situation similar to the situation in which they initially 
made the error. 

• The context of the errors is provided during the testing phase. 

• Students view and listen to their errors exactly as they made them dur¬ 
ing the initial interaction, not spoken by a NS examiner. 

• It allows for comparisons between each student's performance on those 
errors that received recasts and those errors that did not receive recasts. 

• It measures students' ability to detect and correct errors separately; this 
avoids the "all or nothing" view of effectiveness in L2 learning and pro¬ 
vides more fine-tuned testing. 

• It allows for the measure of reaction times when students are detecting 
errors in video clips of their speech. 

The study addressed the following research questions: 

1. Are errors that receive recasts detected more often than errors that do 
not receive recasts? 

2. Are errors that receive recasts corrected more often than errors that do 
not receive recasts? 

3. Are there any differences in students' reaction times in detecting errors 
for episodes that receive recasts and those that do not? 

4. Does the type of test make a difference in the results (i.e., stimulated 
correction vs. written test)? 
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2. Method 

The current study employed an experimental within-subject research design. 
Small groups of adult ESL students participated in an oral task with a teacher 
(the researcher) in which some of each student's errors received recasts and 
some of their errors received no feedback (were ignored). 

2.1. Participants 

In total, 26 adult ESL students participated in the study; three of the students did not 
complete day two of the study and their data were excluded from the analysis. The 
students were all enrolled in intermediate-level classes at the English Language Cen¬ 
tre at the U niversity of Victoria, Canada at the time of the study. The 23 students who 
completed the study had a mean age of 25.2 years, had been living in Canada for an 
average of 4.2 months at the time of the study, and came from seven different LI 
backgrounds. Students were assigned to one of seven small groups on a first-come, 
first-serve basis, and the number of students in each group ranged from three to five 
students. The use of small groups in the present study was designed to mi mic the way 
in which a small group would operate within a classroom. In this way, the present 
study could avoid the individualized attention that students receive in dyadic studies. 

2.2. Procedure 

Data collection took place over two days for each group. On day one, the small 
group of students met with the researcher in a small classroom. The students 
filled out a background questionnaire and then participated in an oral small-group 
activity, which was captured using a digital video camera and an external micro¬ 
phone. The small-group activity consisted of one task involving two parts: jigsaw 
(see Crookes & Gass, 1993; Johnson, 1981) and decision making (see Crookes & 
Gass, 1993; Doughty& Pica, 1986). The average time the groupstookto complete 
the task was 38 minutes. During the activity, the researcher took on the role of 
the teacher and provided recasts following roughly half of each student's errone¬ 
ous utterances. The remaining errors received no feedback (were ignored). In to¬ 
tal, there were 141 erroneous utterances that received recasts and 150 errone¬ 
ous utterances that received no feedback, leading to a ratio of 1.06 to 1,00. 

The recasts in the present study were provided extensively. The recasts pro¬ 
vided during the interaction had the following characteristics: (a) immediately fol¬ 
lowed a student's erroneous utterance, (b) repeated all or part of the student's utter¬ 
ance while reformulating the error(s), (c) did not change or add any information to 
the learner'starget-like portion of the utterance, and (d)employed a rising intonation. 
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Some of these characteristics are illustrated in the following example, which uses (as 
all the other examples) the transcription conventions provided in the appendix: 

Example 1 

SI: They should look the your eye. 

T: Look you in the eye? 

SI: In the eye. 

Each student was tested both orally (stimulated correction task) and in 
written form (written test). In order to create the tests, three types of episodes 
were identified in the video recordings: error+recast, error-recast, and correct. 
Error-recast episodes involved exchanges in which a student made an error and 
then received a recast; in error-recast episodes, the student produced an erro¬ 
neous utterance but did not receive any feedback; correct episodes were those 
in which the student produced a grammatically correct utterance. 

In total, 402 episodes were identified. 333 of these episodes (111 of each 
type) were randomly selected to serve as testing episodes and 69 episodes (3 per 
student) were selected to serve as practice episodes. The episodes were then ed¬ 
ited using iM ovie to create the short video clips, as in Examples 2,3 and 4 below. 
The clips were edited to end with a student's utterance, and the student was re¬ 
quired to judge the grammaticality of their final utterance in the stimulated cor¬ 
rection task. In Examples 2, 3 and 4, the boxes around the utterances indicate 
which utterances were included in the edited clips; the symbol indicates an au¬ 
ditory cue, a "beep" that was inserted to direct the student's attention to the 
utterance of which they would be required to judge the grammaticality. 

Example 2: An error-recast episode 
T: Well, what do you guys think? 

SI: I think if, if, uh Lisa can speak another language, she understands another 
student things £ so she can teach very well that country students. 

T: Students from that country? 

SI: Yeah. 


Example 3: An error-recast episode 

S2: Yeah, I think so. _ 

T: You all agree! For the same reasons or different ones? 

SI: Uh, I think uh Philip is more academic than Lisa, but he has enough- he te- 
j3 he didn't have enough experience to teach to the student. 

T: M m hmm. 

SI: So just uh he tea- he taught reading and writing- 
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Example 4: A correct episode 

T: M m hmm. Okay. So whose work experience do you like the best? Do you think? 

S4:1 think Lisa. 

T: Okay, why? 

S4: j3 Because she has uh lots of experience teaching ESL. 

Because three years she teach English and Lisa also uh it is relative in researching. 

T: M m hmm. 

The day after the small-group interaction, each student completed first 
the stimulated correction task and then the written test individually in a quiet 
room. For the stimulated correction task, which was presented using SuperLab 
4.0, the student was seated in front of a computer with headphones and a mi¬ 
crophone. The procedure was as follows: (1) a screen with the words "The video 
will begin in 3 seconds," which was displayed for three seconds, (2) the playing 
of a video clip, (3) a screen with the words "Press ERROR or NO ERROR," which 
was displayed until the student pressed either the "ERROR" key or the "NO ER¬ 
ROR" key on the keyboard, which students were instructed to do as quickly as 
possible, and optionally (4) a screen with the words "Now correct it," which was 
displayed for 8 seconds and prompted the student to orally correct their error(s) 
(Screen 4 would appear only if the student had pressed the error key). Each 
student was tested only on those clips which involved their own utterances. 
Each student viewed an average of 14.5 video clips (the range was from 7 to 
21). Reaction time was measured between the end of the video clip and the 
time at which the student pressed the error or the no-error key. 

A written test was also created for each student. Each test question con¬ 
sisted of one written sentence (the utterance after the ja in the video clips). Stu¬ 
dents were instructed to read each sentence, indicate if there were any errors, 
and correct any errors. Sentences were presented in random order and students 
were allowed as much time as they needed to complete the written test. 

3. Data analysis 

The coding of the error correction responses involved comparing the student's 
utterance in the interaction with the student's modification of their utterance 
during the stimulated correction task and the written test. Students' modifica¬ 
tions were coded as successful modification (the student corrected all the er- 
ror(s)), partially successful modification (the student corrected only one/some of 
the error(s)), unsuccessful modification (the student failed to correct any errors), 
or no modification (no attempt to correct). For correct episodes only, the coding 
successful meant that the student had changed the (already correct) utterance 
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in such a way that it could also be considered grammatically correct and unsuc¬ 
cessful meant the student changed the correct utterance into an incorrect one. 

Student modification of error-frecast episodes were further coded to allow a 
more detailed analysis of how recasts may have influenced students' ability to mod¬ 
ify their errors: Successful modifications were subcoded as either successful/same 
as recast or successful/ different than recast. Partially successful modifications were 
subcoded as either partial/same as recast or partial/different than recast. 

4. Results 

4.1. Error detection results 

As shown in Table 2, on the error detection task students indicated there were 
errors in error-frecast and error-recast episodes significantly more often than in 
correct episodes on both the stimulated correction task and the written test, 
which was expected. Encouragingly, students detected the majority of the er¬ 
rors in their speech. On both the stimulated correction task and the written test, 
there was a trend for students to detect more errors on the error-frecast epi¬ 
sodes than on the error-recast episodes. On the stimulated correction task, stu¬ 
dents detected 76.6% of the errors in the error-frecast episodes versus 71.2% in 
the error-recast episodes. However, chi-square analysis showed that the differ¬ 
ence did not reach statistical significance [x 2 (1, N =222) =.840, p =.359], On the 
written test, students detected 83.8% of the errors on the error-frecast episodes 
and 82.0% of the errors on the error-recast episodes. A chi-square analysis found 
that the difference was not significant [x 2 (1, N =222) =,127, p =.122]. 


Table 2 Students' error detection responses 



Episode type 

Errors detected 

Errors not detected 


n 

% 

n 

% 


Error-frecast 

85 

76.6 

26 

23.4 

Stimulated 

Error-recast 

79 

71.2 

32 

28.8 

correction 

Correct 

53 

47,7 

58 

52.3 

task 

Total 

217 

65.2 

116 

34.8 





X 2 (2,N =333) =15.10, p=.001 


Error-frecast 

93 

83,8 

18 

16.2 


Error-recast 

91 

82,0 

20 

18.0 

W ritten test 

Correct 

59 

53.2 

52 

46.8 


Total 

184 

82.9 

38 

17.1 





X 2 (2, N =333) =46.636, p =.000 
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4.2. Error correction results: Stimulated correction task 


As shown in Table 3, when the modification patterns for error-recast and error- 
recast episodes on the stimulated correction task were compared, the results 
showed a trend for a greater percentage of successful modifications on er¬ 
ror-recast episodes (25.9%) than of error-recast episodes (17.7%); the same 
trend, but to a lesser degree, was found for partially successful modifications. 
Students also made more modification attempts for error-recast episodes 
(85.9%) than error-recast episodes (78.5%), suggesting that the recasts may 
have assisted them with some of their modifications. Although the differences 
in modification patterns in Table 3 show a benefit from recasts, a chi-square 
analysis found that this difference between error-recast and error-recast epi¬ 
sodes was not significant [y 2 (3, N =164) =2.681, p =.443]. 


Table 3 Error modification patterns (stimulated correction task) 



Total 

Successful 


Partial 

Unsuccessful 

Not modified 

n 

% 

n 

% 

n 

% 

n 

% 

Error-frecast 

85 

22 

25.9 

19 

22.4 

32 

37.6 

12 

14.1 

Error-recast 

79 

14 

17.7 

16 

20.3 

32 

40.5 

17 

21.5 

Correct 

53 

21 

39,6 

NA 

NA 

21 

39.6 

11 

20,8 

Total 

217 

57 

26.3 

35 

16.1 

85 

39.2 

40 

18.4 







X 2 (6, N =217) 

= 18.73,p 

= .005 


Note. Percentages represent percentage of those episodes in which errors were detected. 


Students' successful and partially successful modifications for error-re¬ 
cast episodes were also examined to see if the utterances were modified in the 
same way as the teacher-provided recast had modified their utterance. As Table 
4 shows, when students successfully modified their errors, they were much 
more likely to do so in the same way as the recast had done (81.9%) than in a 
different way than the recast had done (18.1%). This difference wasfound to be 
significant (y 2 [1, N =41] =4.011, p =.045). 


Table 4 Same as recast vs. different than recast successful and partially success¬ 
ful modifications on error-recast episodes (stimulated correction task) 



Same as recast 

Different than recast 


N 

% 

N 

% 

Successful 

18 

81.9 

4 

18.1 

Partially successful 

10 

52,6 

9 

47.4 

Total 

28 

68.3 

13 

31.7 




X 2 (1, A/ =41) =4,011, p = .045 
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4.3. Error correction results: Written test 

The error modification results for the written test are presented in Table 5. 
When the results of the three types of episodes were compared, a chi-square 
found a significant difference [y 2 (6, N =333) =38.415, p =.000]. The most no¬ 
ticeable difference between the modification patterns was found in successful 
modifications, with correct episodes resulting in significantly more successful 
modifications than error+recast and error-recast episodes. 


Table 5 Error modification patterns (written test) 



Total 

Successful 


Partial 

Unsuccessful 

Not modified 

n 

% 

n 

% 

n 

% 

3 3.3 

Error-frecast 

93 

34 

36,6 

33 

35.5 

23 

24.7 

3 3.2 

Error-recast 

91 

21 

23.1 

29 

31.9 

38 

41.8 

1 1.7 

Correct 

59 

36 

61.0 

NA 

NA 

22 

37.3 

7 2.9 

Total 

243 

91 

37.4 

62 

25.5 

83 

34.2 

3 3.3 







X 2 (6, N =333) 

= 38.415, p =.000 


Note. Percentages represent percentage of those episodes in which errors were detected. 


As for the comparison between the results of the error-H-ecast and error- 
recast episodes, it was found that students successfully modified more of their 
errors from error-H-ecast episodes (36.6%) than from error-recast episodes 
(23.1%). They also partially successfully modified more errors from error-H-ecast 
episodes (35.5%) than from error-recast episodes (31.9%). As for unsuccessful 
modifications, error-H-ecast episodes led to far fewer (24.7%) of these than error- 
recast episodes (41.8%). The percentage of detected errors left unmodified was 
similar for error-H-ecast and error-recast episodes (3.3% and 3.2% respectively). 
Thus, the error modification results for error-H-ecast and error-recast episodes 
show that students performed better on the error-H-ecast episodes than on the 
error-recast episodes, but when a chi-square was performed to see if the differ¬ 
ences in modification patterns between error-H-ecast and error-recast episodes 
were significant, it was found that they were not [y 2 (3, N =222) =6.998, p =.072], 
As with the results from the stimulated correction task, the modification 
patterns of error-H-ecast and error-recast episodes on the written test were ex¬ 
amined in more detail. Table 6 presents the results of combining successful with 
partially successful modifications and unsuccessful with not modified responses. 
As can be seen, error-H-ecast episodes resulted in many more successful and par¬ 
tially successful modifications than error-recast episodes (72.0% vs. 54.9%), and 
this difference was found to be significant [y 2 (1, N =222) =5.807, p =.016]. 
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Table 6 Combined error modification patterns, error-recast and error-recast 
episodes (written test) 



Successful & partial 

Unsuccessful & not modified 

n 

% 

n 

% 

Error-frecast 

67 

72,0 

26 

28.0 

Error-recast 

50 

54.9 

41 

45.1 

Total 

117 

63,6 

67 

36.4 


_ X 2 (1, N =222) =5.807, p =.016 

Note. Percentages represent percentage of episodes in which errors were detected. 


Student modifications on error-recast episodes were also examined in 
greater detail. Table 7 displays the frequency of students' successful and par¬ 
tially successful modifications according to whether they were done in the same 
way as the recast which had been provided or in a different way than the recast 
which had been provided. It was found that for both successful and partially 
successful modifications, students were more likely to modify their utterance in 
the same way as in the recast (82.4% of successful modifications and 66.7% of 
partially successful modifications). However, a chi-square showed that these 
differences did not reach significance [y 2 (1, N =67) =2.176, p =.140], 


Table 7 Same as recast vs. different than recast successful and partially success¬ 
ful modifications on error-recast episodes (written test) 



Same as recast 

Different than recast 


n 

% 

n 

% 

Successful 

28 

82.4 

6 

17,6 

Partially successful 

22 

66.7 

11 

33.3 

Total 

50 

74.6 

17 

25.4 




X 2 (1, N =67) =2.176, p =.140 


Thus, in response to the question of whether errors that received recasts 
would be accurately corrected by students more often than errors that did not 
receive recasts, the results of the written test showed that students' modifica¬ 
tions of their errors on error-recast and error-recast episodes were significantly 
different. Specifically, when successful was combined with partially successful 
and unsuccessful was combined with not modified, it was found that error-re¬ 
cast episodes lead to significantly more successful and partially successful mod¬ 
ifications than error-recast episodes. 
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4.4. Comparison between stimulated correction and written test 

Table 8 showsthe differences in error modification between the stimulated cor¬ 
rection task and the written test. Positive numbers indicate the students had 
more of that type of modification on the written test than on the stimulated 
correction task; negative numbers indicate that the students had fewer of that 
type of modification on the written test than on the stimulated correction task. 


Table 8 Gains/losses in modifications from stimulated correction task to written test 



Successful 


Partial 

Unsuccessful 

Not modified 

n 

% 

n 

% 

n 

% 

n 

% 

Error-frecast 

+12 

+10,7 

+14 

+13.1 

-9 

-12.9 

-9 

-10,8 

Error-recast 

+7 

+5.4 

+13 

+11.6 

46 

+1.3 

-14 

-18.3 

Correct 

+15 

+21.4 

NA 

NA 

+1 

-2.3 

-10 

-19.1 


In general, we see a trend for students to perform more favourably on 
the written test than on the stimulated correction task. However, the effect 
was not even across the three episode types. For example, while error-recast 
episodes had 12.9% fewer unsuccessful answers on the written test than on 
the stimulated correction task, for error-recast and correct episodes the num¬ 
ber of unsuccessful modifications actually increased on the written test (+1.3% 
and +2.3% respectively). 

4.5. Reaction time results 

Reaction time (i.e., time between the end of the video and when the student 
pressed the error or the no-error key) results, which are presented in Table 9, 
show that students were fastest on the correct episodes (2.16076 sec.); stu¬ 
dents were slightly faster on the error-Recast episodes (2.24956 sec.) than on 
the error-recast episodes (2.28148 sec.). However, a one-way ANOVA found 
that the differences in reaction times were not significant [F(2, 330) = .903, p = 
.406]. There was quite a bit of variation in reaction times within each episode 
type, and this may explain why the differences in reaction times between the 
three types of episodes were not significant. 


Table 9 Reaction times in error detection on the stimulated correction task 



M (seconds) 

SD 

M inimum value 

Maximum value 

Error4recast 

2.24956 

0.66770 

0.96560 

4.65070 

Error-recast 

2.28148 

0.70665 

1.20420 

5.40670 

Correct 

2.16076 

0,70545 

0.91940 

4.90460 
p =ns 


Note, ns = nonsignificant 
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Reaction times for error-mecast and error-recast episodes were examined 
in relation to students' modifications of their errors, as shown in Table 10. The 
reaction times for error-mecast and error-recast episodes were very similar when 
the student produced unsuccessful modification or no modification (2.24476 sec. 
and 2.24373 sec. respectively), but were faster on error-mecast episodes when 
students modified their utterance in the same way as in the recast. These results 
suggest that the recasts may have led to faster reaction times for some of the 
episodes, namely those in which the students used the information from the re¬ 
cast to modify their utterance during the error correction task. However, a one¬ 
way ANOVA revealed thatthe differences in reaction times for error-mecast/same 
as recast, error+recast/different than recast, and error-recast episodes were not 
statistically significant [F(2,68) =1.141, p =.326]. 


Table 10 Mean reaction times by nature of modification for error-mecast and 
error-recast episodes 




Successful & partial 

Unsuccessful & not modified 


Same as recast 

Different than recast 

Error-frecast 

2,14991 

2.51741 

2.24476 

Error-re cast 


2.40696 

2.24373 
p =ns 


Note, ns=nonsignificant 


5. Discussion 

The aim of the present study was to investigate whether incidental, extensive re¬ 
casts provided by a teacher in a small group outside a classroom were beneficial 
to adult ESL learners. The results of the error detection task showed a trend for 
students to detect more errors in the error-tfecast episodes than in the error-re¬ 
cast episodes; this may point to a benefit from recasts in terms of students' ability 
to perceive errors in their own speech. However, the difference between error 
detection rates on error-mecast and error-recast episodes failed to reach statisti¬ 
cal significance on either the stimulated correction task or the written test. One 
possible explanation for this was that there seemed to be an overall bias for stu¬ 
dents to think that their speech contained errors, and this bias was found across 
all three episode types, including correct episodes. This bias may have been par¬ 
tially responsible for the high error detection rates on the error-frecast and error- 
recast episodes. These high rates, all over 70%, may have been close to ceiling, 
thus muting any beneficial effect from the recasts and leading to the nonsignifi¬ 
cant difference between the error-frecast and error-recast episodes. 

It was also found that on both the stimulated correction task and the writ¬ 
ten test, students were able to successfully and partially successfully modify 


33 




Laura Hawkes, Hossein Nassaji 

more of their errors from the error-recast episodes than from the error-recast 
episodes. While the difference in modification patterns was a trend in the stim¬ 
ulated correction task, it produced a statistically significant difference on the 
written test. These findings show that recasts seemed to benefit students' abil¬ 
ity to correct errors in their own speech. In addition, there is evidence that the 
specific content of the recasts may have assisted students in the modifications 
of their errors. These results are especially encouraging given that the recasts 
that students received in the present study were spontaneous and extensive. 

As for the students' reaction times, the results show that students re¬ 
sponded slightly more quickly to error-irecast episodes than to error-recasts, 
but this difference was not significant. The fact that students were able to re¬ 
spond more quickly indicates that students were possibly more confident (ei¬ 
ther consciously or unconsciously) in their answers to error-irecast episodes 
than to error-recast episodes. While this trend did not reach significance, this 
was likely due at least in part to the great amount of variability in reaction times 
between students. There was also a trend for students to respond more quickly 
in the error detection task if they later went on to provide a successful or par¬ 
tially successful modification that was the same as the recast they had been 
given during the interaction. It is possible that the recast provided the day be¬ 
fore was stored in a location that was "easily accessible," at least for a short 
period of time after it was provided. 

The study also examined how students responded to correct episodes 
and how these responses compared to the error-irecast and error-recast epi¬ 
sodes. The data from the correct episodes showed that students perceived a 
considerable portion of their correct episodes as containing errors. On the stim¬ 
ulated correction task, students indicated that 47.7% of the correct episodes 
contained errors, while the rate for the written test was even higher, at 53.2%. 
These results indicate that students are often wrong in the assessment of the 
grammaticality of their own utterances, with a tendency to think that their ut¬ 
terances are less grammatical than they are. Results also showed that students 
often modified their grammatical utterances in correct episodes in such a way 
that they became ungrammatical. 

It is also possible that the provision of recasts during the interaction may 
have increased students' belief that they were frequently producing erroneous 
utterances. Since all of the students received a number of recasts during the 
interaction, they may have assumed that they were producing quite a few er¬ 
rors in their speech. This may be particularly true if the students were from 
classrooms where corrective feedback is used infrequently. If this was the case, 
it could explain why students answered "error" so frequently on the error de¬ 
tection task. It would also signify that recasts might produce effects on students 
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beyond the error targeted by the recast. Specifically, the provision of recasts 
may also push students to question the grammaticality of their utterances that 
did not receive recasts. This possibility has yet to be examined by other re¬ 
searchers. In general, recast studies have only examined the effect of recasts on 
those individual erroneous utterances that received recasts (Loewen, 2005; 
Loewen & Philp, 2006; Nassaji, 2006, 2007) or on a target linguistic form (Am- 
mar & Spada, 2006; Dilans, 2010; Ellis et al., 2006; Ishida, 2004; Lyster, 2004; 
M ackey & Philp, 1998; Saito, 2013). 

All of the results of the present study, including those that reached signif¬ 
icance and those that did not, point in the same direction: Recasts were of ben¬ 
efit to students in terms of their ability to detect and correct errors in their own 
speech. Specifically, when presented with errors in their own speech in video 
clips and in written form, students were able to detect, as well as successfully 
and partially successfully modify, more of the errors that had received recasts 
during the interaction than those that had not received recasts. In addition, 
there is some evidence that recasts may have allowed students to detect and 
correct their errors more quickly. Thus, overall, it can be concluded that the re¬ 
casts in the present study were beneficial to students. 

The positive results of this study are particularly noteworthy in two re¬ 
spects. First, the recasts were beneficial even though they were provided in 
small-group (rather than dyadic) interaction. This indicates that recasts can be 
beneficial even when the teacher's attention is divided between several stu¬ 
dents. Secondly, the results of the present study show that recasts can be ben¬ 
eficial to students even when they are provided incidentally and extensively. 
Thus, while previous studies have demonstrated a benefit from intensive recasts 
(Ammar & Spada, 2006; Doughty & Varela, 1998; Ellis at al., 2006; Leeman, 2003; 
Lyster, 2004; M ackey, 2006; M ackey & Philp, 1998; M cDonough & M ackey, 2006; 
Philp, 2003; Saito, 2013, among others), the findings of the present study demon¬ 
strate that recasts do not necessarily need to be provided intensively to be effec¬ 
tive; even a single recast can be of benefit to students. These results confirm Loe¬ 
wen and Philp's (2006) and Nassaji's (2009) findingsthat spontaneous and exten¬ 
sive recasts can benefit students. As teachers often naturally provide incidental, 
extensive recasts in their classes, these results should be seen as especially en¬ 
couraging for both researchers and teachers. Previous studies that have exam¬ 
ined intensive recasts may have made teachers feel that recasts could only be 
beneficial if they were provided intensively. The present study shows that recasts 
can be effective when provided in response to a wide range of linguistic errors, 
even if some linguisticformsreceive only one recast. As such, teachers should not 
be discouraged from incorporating spontaneous, extensive recasts into commu¬ 
nicative-based oral interaction with their students. 
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6. Conclusions and implications 

The present study has incorporated a number of innovative methodological fea¬ 
tures. First, in this study each student was tested on episodes that involved er¬ 
rors in their speech that did receive recasts, episodes that involved errors that 
did not receive any type of feedback, and episodes that did not involve any stu¬ 
dent errors. While a within-subject design may not be necessary for studies of 
recasts directed towards preselected target forms, such methodology might be 
very important in the study of spontaneous, extensive recasts. Previous posttest 
studies of spontaneous, extensive recasts (Loewen, 2005; Loewen & Philp, 2006) 
have not compared their test scores for errors that received recasts with any¬ 
thing. The present study, on the other hand, addresses the need for a control. 

A second innovation of the present study is the use of stimulated correction 
as an instrument to measure learning. Stimulated correction was designed by the 
researchers and, to the best of our knowledge, is the first individualized posttest 
to make use of video clips in the assessment of learning following corrective feed¬ 
back. Stimulated correction has several advantages over traditional written post¬ 
tests, such as allowing students to see and hear the exact context in which the 
errors were made, and forcing students to make their judgments quickly, which 
may increase the chances that students are making use of their implicit knowledge 
to complete the task. This may be important given that it is often assumed that 
recasts are an implicit form of corrective feedback (Goo & M ackey, 2013). 

A final significant contribution of the present study is that it was the first 
study in the area of recasts to examine student reaction time when completing 
posttests. W hile the reaction time results of the present study did not produce sig¬ 
nificant findings, it does not mean that the measurement of reaction time is not 
relevant to the study of corrective feedback. The measurement of reaction time 
has led to significant advancements in knowledge in fields such as psycholinguistics 
and will likely lead to important advancements in knowledge in SLA in the future. 

Despite the contributions of the present study to the field of recast re¬ 
search, there are a number of limitations to the present study. First, the number 
of students who participated in the study was relatively small, and these small 
numbers may have been (at least partially) responsible for the lack of significant 
findings on some of the measures despite clear trends for a benefit from recasts 
on all of the measures. 

Another limitation of this study is that it was conducted in a small-group 
environment (as opposed to a classroom environment). While this likely reflected 
a much more natural situation than that of dyadic studies, we must keep in mind 
that the results cannot necessarily be applied to a whole-class situation. This 
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being said, within L2 classrooms, teachers often divide students into small 
groups to complete tasks, projects, etc. 

Another limitation of the study was the fact that all the students com¬ 
pleted the stimulated correction task before the written test. The primary goal 
of this study was to examine students' responses on the new methodology, 
stimulated correction. Since the number of students in the present study was 
not large, it was decided not to further divide the students into two groups to 
receive the stimulated correction task and the written test in a cross-balanced 
manner. While a practice effect could have been in place on the written test, it 
should be remembered that the purpose of the test was not to see how accurate 
students were on error-recast episodes alone but to compare how accurate they 
were on error-recast and error-recast episodes. It was assumed that any practice 
effect would equally affect the error-recast, error-recast, and correct episodes. 
In addition, since the recasts were cut out of the video clips, the possibility that 
students would learn from hearing the recasts a second time was avoided. 

The present study has answered the research questions it set out to ad¬ 
dress, but, as can be seen, it has also initiated the discussion of several new 
questions. Therefore, a great deal more of research is still needed in this area. 
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APPENDIX 


Transcription conventions 


T 

S 

+ 

«» 

Si 

Underlined 

Bold 

Italics 

Bold-italics 


teacher 
student 
pause 
false start 

extra-lingual information (such as laughing) 
auditory cue 

erroneous student utterance 
recast 

correct student utterance 
student modification 


41 




